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JOINT SYMBOL, AMPLITUDE, AND 
RATE ESTIMATOR 

Inventor: 
Thomas P. McElwain 

RELATED APPLICATIONS 

[0001] This application claims the benefit of U.S. Provisional Applications 
No. 60/462657, filed April 14, 2003. This application is also related to U.S. Patent 
application Serial No. 10/105,918, filed March 25, 2002 entitled "System for Decreasmg 
Processing Time in an Iterative Mvdti-User Detector System" (D4608); U.S. Patent 
Application Serial No. 10/414,738 filed April 16, 2003 entitled "System and Method for 
Ino^asing Throughput in a Multiuser Detection Based Multiple Access Communications 
System" (D4615); and U.S. Patent Application Serial No. 10/208409 entitled "Power and 
Confidence Ordered Low Complexity Soft TurboMUD with Voting System" (D-4627US). 
Each of these ^^plicadons is herein incorporated in their entirety by reference. ^ 

FIELD OF THE INVENTION 

[0002] The present invention relates to coinmimications and more particularly to co- 
channel communications, and . more particxilarly to increasing capacity by allowing 
interfering channels. 

BACKGROUND OF THE INVENTION 

[0003] The telecommunications industry has been expanding at an unprecedented growth 
rate throughout the World. In particular, the wireless sector, including cell phones, PCS, 
wureless local area networks and Bluetooth such as IS95, GSM, 3G, IEEE 802.1 la^/g, and 
802.16 has grown far beyond expectations and at a much higher rate than the fixed 
telecommunications (wired) coxmterpart. The ability to access data and communicate 
anywhere at anytime has enormous potential and commercial value. 
[0004] The content of the wireless sector is also changing, with more and more data being 
transmitted, including Internet connectivity and live feeds. The usage involving persor^al 
digital assistants (PDA's) and even smart appliances have created new markets utilizing 
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wireless data communications. Despite advancements in wireless transmission and 
reception, there is a growing problem of extracting more information signals within a 
limited bandwidth. 

[0005] Emerging multiple-access receiver processing procedures allow for multiple users 
to access the same communications medium to transmit or receive information. Mxiltiple 
access conamunication systems allow the transmission of miiltiple digital data streams 
between multiple transmitting and receiving devices. However, since many users transmit 
energy on the same communications channel, a number of inherent difficulties arise, 
particularly when receivers attempt to detect the information associated with a particular 
user when there is heavy signal interference created by other users of the system at the 
same time. Typically the signal of interest caimot be received or the quality of reception is 
significantly degraded. 

[0006] For example, a base station that processes a number of cellular devices has to 
receive and transmit data within a certain frequency range. The ability to extract the 
correct *data froSi a given tiser is a difficult task when the effects of interference 2ind 
multipaths are considered. The problem is further complicated when the number of users 
exceeds the number of dimensions (e.g. time slots, frequency slots, polarizations, etc), 
resulting in an overloaded condition. 

[0007] In addition to the problems associated with multiple users in a given bandwidth, an 
additional problem is the inability to process the data in the receivers in real time. 
Advanced receiver techniques cover several areas, namely interference suppression (also 
called Multi-User Detection or MUD), multipath combining and space-time processing, 
equalization, and channel estimation. These various techniques can be mixed and matched 
depending upon the circumstances. 

[0008] One way of alleviating some of the multiple access problems is to separate the 
interfering transmissions at the receiver using signal processing techniques. However, 
state of the art receivers are not capable of detecting and decoding flie information 
associated with each user imder conditions of heavy interference. Anotiier solution to the 
co-channel interference problem is to decrease the number of users per channel. This, of 
course, is not an attractive option for telecommunication companies, since obtaining the 
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maximum number of users or managing peak volume transmission periods are important 
business objectives. 

[0009] It should be understood that the discussion herein illustrates wireless cellular 
conunimications the multiple access topologies are equally applicable to wired cable 
systems and local area networks, read/write operations of a disc drive, satellite 
communications and any application that benefits from extracting digital information from 
among many multiple interfering signals. 

[0010] Several techniques are used to improve results in co-chaimel multiple access 
communications systems. Frequency-Division Multiple Access (FDMA) assigns a 
different firequency to each user and parses an allocated band for a communication system 
wherein a single iiser's signal transmission power is concentrated into a single narrower 
radio firequency band. Interference from adjacent chaimels is limited by the use of band 
pass filters, however for each channel being assigned a different frequency system the total 
capacity is limited by the available firequency slots and by physical limitations imposed by 
frequency reuse. In a celliilar telephone configuration this poses problems because all 
proximate cells must operate on different frequencies. However, frequency bands may be 
re-used, provided that the same firequency cells are positioned at a ceftaiii distance apart. 
A fiirther drawback with FDMA schemes is that users will pay fiill-time for their assigned 
frequency regardless of thehr actual use of the system. 

[0011] Code Division Multiple Access (CDMA) is another multiplexing technique 
wherein for each communication channel the signals are encoded using a sequence known 
to the transmitter and the receiver for that specific channel. In CDMA, all users use the 
same frequency at the same time. However, before transmission, the signal from each user 
is multiplied by a distinct signature waveform. The signature waveform is a signal that has 
a larger bandwidth than the information-bearing signal from the user. However, in a 
CDMA system, the total level of co-channel interference limits the number of active users 
at any instant of time. 

[0012] In Time Division Multiple Access (TDMA) technology, mxiltiple channels of data 
are temporally interleaved, i.e. each signal is assigned to a different time interval and the 
signals are transmitted individually, according to their assigned time slot. The TDMA 
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channel consists of a time slot or frame in a periodic train of time intervals over the same 
frequency, with a given signal's energy confined to one of these time slots. However, in a 
TDMA system, all transmitters and receivers must have access to a common clock, as 
time-synchronization among the users is required. Adjacent chaimel interference is limited 
by the use of a time gate or other synchronization element that only passes signal energy 
received at the proper time. The system capacity is limited by the available time slots 
(within a given frequency band) as well £is by physical limitations imposed by frequency 
reuse, as each channel is assigned a different time slot within a particular frequency band. 
[0013] One of the goals of FDMA and TDMA systems is to try and prevent two 
potentially interfering signals from occupying the same frequency at the same time. In 
contrast. Code Division Multiple Access (CDMA) techniques allow signals to overlap in 
both time and frequency. CDMA signals share the same frequency spectrum at the same 
time, hence, the CDMA signals appear to overlap one another. The scrambled signal 
format of CDMA vutually eliminates cross talk between interfering transmitters. 
[0014] In a CDMA syistem, each signal is transmitted, using spread spectrum techniques. 
The transmitted informational data stream is impressed upon a much higher rate data 
stream termed a signature sequence. The bit stream of the signature sequence data is 
typically binary, and can be generated using a pseudo-noise (PN) process tibat appears 
random, but can be replicated by an autiiorized receiver. The informationaLl data stream 
and the high bit rate signature sequence stream are combined by multiplying the two bit 
streams together, assuming the binary values of the two bit streams are represented by +1 
or -1. This combination of the higher bit rate signal vnth the lower bit rate data stream is 
called spreading the informational data stream signal. Each informational data stream or 
channel is allocated a imique signature sequence. 

[0015] In operation, a stream of spread information signature signals are modulated by 
weights corresponding to the information that is to be transmitted. Some modulation 
examples include binary phase shift keying (BPSK) and quadrature phase shift keying 
(QPSK). If several transmitters modulate then: data onto the signature waveform and 
modulate again with Hie carrier tone, a radio frequency (RF) signal comprised of a 
continuous stream of information modulated signature pulses vsdll be present at the 
receiver, one corresponding to each transmitter. The plurality of transmitted signals and are 
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jointly received as a composite signal at the receiver. Each of the spread signals overlaps 
all of the other spread signals in time and frequency. Moreover, environmental noise as 
well as receiver electronic noise is also present in the measiired received signal. The state 
of the art receiver correlates the composite noisy signal with one of the unique signature 
sequences, and the corresponding information signal is isolated and despread while the 
other signals appear as only small additions to the noise floor. 

[0016] A signature sequence is often used to represent one bit of information* Receiving 
the transmitted sequence or its complement indicates whether the information bit is a +1 or 
-1, sometimes denoted "0" or "1". The signature sequence usually comprises N pulses, and 
each pulse is called a "chip". The entire N-chip sequence, or its complement, depending 
on the information bit to be conveyed, is referred to as a transmitted symbol. 
[0017] The receiver correlates the received signal with the complex conjugate of the 
known signature sequence to produce a correlation value. When a 'large' positive 
correlation results, a "0" is detected, and when a * large' negative correlation results, a "1" 
is detected. 

[0018] It should be understood that the information bits could also be coded bits, where the 
code is a block or ponvolutional code. Also, the signature sequence can be much longer . 
. than a single transmitted symbol, in which case a subsequence of the signature sequence is 
used to spread the information bit. 

[0019] The prior systems do not properly accoimt for the real world mobile 
commimication signals that suffer from signal degradation such as interference and 
multipath problems. The systems of the state of the art generally tended to make 
assumptions that all other interferers and multipaths were additive white Gaussian noise. 
However, this assumption is not accurate for co-channel interference and mxiltipaths. 
[0020] Multipath dispersion occurs when a signal proceeds to the receiver along not one 
but many paths so that the receiver encoimters echoes having different and randomly 
varying delays and amplitudes. The receiver receives a composite signal of multiple 
versions of the transmitted symbol that have propagated along different paths, called rays, 
having different relative time. Each distinguishable ray has a certain relative time of 
arrival, a certain amplitude and phase, and as a result, the correlator outputs several smaller 
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spikes. RAKE receivers are well known and attempt to *rake' together all the 
contributions to detect the transmitted symbol and recover the information bit. 
[0021] Conventional RAKE receivers provide satisfactory performance for operation in 
the presence of multipath vmder ideal conditions however the signature sequence must be 
uncorrelated with time shifted versions of itself as well as variovis shifted versions of the 
signature sequences of the other CDMA signals, Co-channel interference refers to signals 
received from other users either directly or reflected. If one received signal corresponding 
to the signature sequence of interest has a non-negligible cross correlation with the 
received signal originating from another transmitter (a co-channel interferer), then the 
value measured at the receiver, e.g. the correlation value for flie signal of interest, is 
corrupted. In other words, the correlation computed at the receiver that would be used to 
decode a particular signal of interest is overwhelmed by an interfering signal; this is 
referred to as the near-far problem. The interference caused by an echo of one transmitted 
symbol overlapping with the next transmitted symbol might also be non-negligible. If this 
is the case, the transmitted symbols interfere with past and fixture transmitted symbols. 
This is commonly referred to as intersymbol interference (ISI). In actuality, performance 
is degraded both by cb^channel interference and ISI. 

[0022] There has been much research to address signal interference with known multipath 
time dispersion. This is termed joint deniodulation with no multipath and is fiirther 
described m S. Verdu, "Mimmum Probability of Error For Asynchronoiis Gaussian 
Multiple-Access Channels," IEEE Trans; Info. Theory, Vol. IT-32, pp. 85-96, R. Lupas 
and S. Verdu, "Linear multiuser detectors for synchronous code-division multiple-access 
channels," IEEE Trans. Inform. Theory, Vol. 35, pp. 123-136, Jan. 1989; and R. Lupas and 
S. Verdu, *TMear-far resistance of multiuser detectors in asynchronous chaimels," IEEE 
Trans. Commun., Vol. 38, pp. 496-508, Apr. 1990. 

[0023] There are a host of approaches for jointly demodulating any set of interfering 
digitally modulated signals, including multiple digitally modulated signals. Maximum 
Likelihood Sequence Estimation determines the most likely set of transmitted information 
bits for a plurality of digital signals without multipath time dispersion. The maximum 
likelihood joint demodulator is capable, in theory, of accommodating the largest number of 
interfering signals, but has a prohibitive computational complexity that makes it 
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unrealizable in practice. The decorrelation receiver is another, less computationally 
complex receiver processing approach that zeroes out or decorrelates the different signals 
so that they no longer interfere with one another. The decorrelator as well as virtually 
every other lower complexity joint demodulator, is not capable of operation when the 
number of signals is over a set threshold which falls significantly short of the theoretical 
maximum. 

[0024] In a real world multi-user system, there are a nimiber of independent users 
simultaneously transmitting signals. These transmissions have the real-time problems of 
multi-patii and co-channel interference, fading, and dispersion that aSect the received 
signals. As described in the prior art, multiple user systems communicate on the same 
frequency and at the same time by utilizing parameter and channel estimates that are 
processed by a multi-user detector. The output of the optimal multi-user detector operating 
within the multiuser capacity limits of the channel is an accurate estimation as to the 
individual bits for an individvial user. 

[0025] Moreover, in an article by Paul D. Alexander, Mark C. Reed, John A. Asenstorfer 
and Christian B. Schlagel in IEEE Transactions on Communications, vol. 47, number^ 7, 
July 1999, entitled "Iterative Multi-User Interference Reduction: Turbo CDMA," a system 
is described in which multiple users can transmit coded information on the same frequency 
at the same time, with the multi-user detection system separating the scrambled result into 
interference-free voice or data streams. 

[0026] Low complexity multiuser detector have been contemplated that use linear 
multiuser detectors to achieve optimal near-far resistance. (Near-Far Resistance of 
Multiuser Detectors for Coherent Multiuser Communications, R. Lupas, S. Verdu, IEEE 
Trans. Commun. Vol. 38, no. 4, pp 495-508, April 1990). While providing certain 
advantages, the performance has not been demonstrably improved. Varanasi and Aazhang 
proposed a multistage technique as described in the article Near-Optimxmi Detection in 
Synchronous Code-Division Multiple Access Systems, IEEE Trans. Commun., Vol. 39, 
No. 5, May 1991. 

[0027] Decorrelatmg decision feedback detectors (DDFD) have been described by A. 
Duel-Hallen m Decorrelating Decision-Feedback Multiuser Detector for Synchronous 
Code-division Multiple Access Channel, IEEE Trans. Commun., Vol. 41, pp 285-290, Feb. 
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1993. Wei and Schlegel proposed soft-decision feedback to suppress error propagation of 
the DDFD in Synchronous DS-SSMA with Improved Decorrelating Decision-Feedback 
Multiuser Detection, ffiEE Trans. Veh. Technol., Vol. 43, pp 767-772, Aug. 1994. Tree- 
type maximum-likelihood sequence detectors were also proposed for multiuser systems as 
were breadth-first algorithms and sequential detection including using the M-algorithm 
tree-search scheme with a matched filter (MP). The prior references also reveal schemes 
that include some form of decorrelating noise whitening filter (WF). 
[0028] Howevfer, one of the primary disadvantages of the prior references implementations 
is the inability to accommodate overloaded conditions. Decision feedback techniques are 
limited in that they are incapable of working in supersaturated environments. Although the 
MMSE-based decision feedback detector can work in a supersaturated environment, it has 
been demonstrated to be too aggressive with hypothesis testing to produce accurate results. 
[0029] Multi-user detection (MUD) refers to the detection of data in non-orthogonal 
multiplexes. MUD processing increases the number of bits available per chip or signaling 
dimension for systems having interference limited systems. A MUD receiver jointly 
demodulates co-pbannel interfering digital signals. Multiuser detectioii systems take full 
advantage of all information available at the receiver, by making use of any "knowledge" 
that the receiver has about the interfering signals. Because the number of users that can be 
packed into a MUD-based multiple access (MA) system is a function of the number of 
independent dimensions over which the set of signals is spread (the dimension of the span of 
&e set of signals), the total number of users in the system can be increased if more 
dimensions are used for transmitting the signals and the same dimensions are accessible at 
the receiver. 

[0030] In addition to expanding the number of dimensions, favorably "spreading*' the 
received signals out over those dimensions can also allow for increases in the number of 
users a MUD-based system can accommodate. For example, typical signaling sets for 
multiuser conamunications do not include as a free parameter the reference amplitude of 
each user. In the IS95 code division protocol, amplitude is controlled completely for 
purposes of power control to meet a signal-to-noise specification (all users ideally being 
received with the same signal-to-noise ratio (SNR)). Therefore the advantages offered to 
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the MUD are not exploited and the aggregate throughput of a multiple access system is 
limited if amplitude is not exploited, 

[0031] There are various multiuser detectors in the art, including optimal or maximum 
likelihood MUD, maximum likelihood sequence estimator for multiple interfering users, 
successive interference cancellation, TurboMUD or iterative MUD, and various linear 
algebra based multi-user detectors such as all of those detailed in the well-known text 
"Multiuser Detection" by Sergio Verdu. In the state of the art, algebraic means are used to 
compute linear operators for the entire set of users (communications channels) 
simultaneously. This is done by utilizing prior information, or knowledge of the likely 
value of each user's bit of information, each at a particular instant in time. This multiuser 
detection processing is described in the text S. Verdu, Multiuser Detection, Cambridge 
Press, 1998. However, this suffers from a significant disadvantage in that it requires 
knowledge of all parameters to perform the processing, 

[0032] Optimal MUD based on the maximum likelihood sequence estimator operates by 
comparing the received signal with the entire number of possibilities that could have 
resulted, one for each bit or symbol epoch. The number of possible measured levels for the 
received signal is exponentially related to the number of users and the duration of the ISL* 
Hence, the optimal processing is a computationally complex, and it is riot possible to 
accomplish in a real-time environment. Thus for those multi-user detectors that exanodne 
the entire space, real-time operation is often elusive. 

[0033] In general, optimal MUD units function by examining a number of possibilities for 
each bit. However, for miilti-user detectors that examine a larger capacity of signal, the 
computations are complex and time-consuming, thus making real-time operation 
impossible. Numerous attempts at reliable pruning of the optimal MUD decision process 
or the use of linear approximation to the replace the optimal MUD have still not produced 
a workable solution for the real world environment 

[0034] There are several suboptimal multiuser detectors that are less computationally 
complex and known in the art. One example of suboptinial detectors, called linear 
detectors, includes decorrelators, minimum mean square error or MMSE detectors, and 
zero-forcing block linear equalizers. The conventional Minimum Mean Squared Error 
(MMSE) Multiuser detector utilizing prior information is described by Wang and Poor in 
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"Iterative (Turbo) Soft Interference Cancellation and Decoding for Coded CDMA", in the 
Transactions on Communications, My 1999. See also Alexander, Reed, Asenstorfer, and, 
Schlegel, "Iterative Multiuser Interference Reduction: Turbo CDMA," IEEE Trans on 
Comm, July 1999; and Poor, "Turbo Multiuser Detection: An Overview" ISSSTA 2000. 
But, linear algebra based MUD (non-iterative) and successive interference cancellation 
fails for cases of overloaded multiple access systems. 

[0035] One example of overloading is whext the number of simultaneous users is doubled 
or tripled relative to existing state of the art Even for underloaded multiple access 
systems, tiie performance of non-iterative MUD and successive interference cancellation 
degrades significantly as the number of users increases, while the computation complexity 
of the optimal MUD increases significantly as the number of users increases. The 
computing problems are so extreme that even the most expensive hardware unbound by 
size and weight can often to keep us witii this overwhelming complex processing 
requirement of optimal MUD. Moreover, an unreasonable delay would be required to 
decode each bit or symbol rendering such a system useless in practice. 
[0036] Reduced complexity approaches based on tree-pruning help to some extent to 
eliminate the improper, bit combination from consideration where, ideally, such a 
procedure should prune out many 'bad' paths in the decision tree but maintain the proper 
path. Thus, the entire tree does not need to be traversed to make the final decision. 
[00371 The M-algorithm is a pruning process that limits the number of hypotheses 
extended to each stage to a fixed tree width and prunes based on ranldng metrics for all 
hypotheses and retaining only the M most likely hypotheses. The T-algorithm prunes , 
hypotheses by comparing the metrics representing all active hypotheses to a threshold 
based on the metric corresponding to the most-likely candidate. Performance of M- 
algorithm based MUD degrades as the parameter M is decreased, but M governs the 
number of computations required. Similar effects are seen for other tree-pruning based 
MUD (T- algorithm, etc). To combat improper pruning, basic tree-pruning must ensure 
that M is "large enough", and therefore still encoimters increased complexity for 
acceptable performance levels when the number of interfering signals and/or ISI lengths 
are moderate to large. 
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[0038] As an illustration of the M-algorithm as a tree-pruning algorithm, consider a tree 
made up of nodes and branches. Each branch has a weight or metric, and a complete path 
is sequences of nodes connected by branches between the root of the tree and its branches. 
When applied as a short cut to the optimal MUD, each branch weight is a function of the 
signature signal of a certain transmitter, the possible bit or symbol value associated with 
that transmitter at that point in time, and the actual received signal which includes all the 
signals from all the interfering transmissions. The weight of each path is the sum of the 
branch metrics in a complete path. The goal of a tree searching algorithm is to try to find 
the complete path through a tree with the lowest metric. With the present invention the 
metrics of multiple complete padas are not calculated. Rather, the metrics of individual 
branches in a tree are calculated in the process of locating one complete path through the 
tree and thereby defines one imknown characteristic of each of the co-channel, interfering 
signals needed to decode the signals. 

[0039] A MUD algorithm within the TurboMUD system determines discrete estimates of 
the transmitted chaimel symbols, with the estimates then provided to a bank of single-user 
decoders (one decoder for each user) to recover the input bit streams of all transmitted 
signals. Two general types of multi-xiser detectors within the TurboMUD system are 
possible, namely those that provide hard outputs, which are discrete values, and those that 
provide soft outputs, which indicate both the discrete estimate and the probability that the 
estimate is correct. In basic terms, turbodecoding refers to breaking a large processing 
process into smaller pieces and performing iterative processing on the smaller pieces until 
the larger processing is completed, and this basic principle was applied to the MUD. 
[0040] However, single-user decoders operating on hard values, or discrete integers, have 
unacceptable error rates when there is a large amount of interference or noise in the 
received signal. The reason is that discrete integers do not provide adequate confidence 
values on which the single-user decoder can operate. These decoders operate better on so- 
called soft inputs in which confidence values can range from -1 to 1, such as for instance 
0,75 as opposed to being either -1 or +1. To provide soft values that can then be utilized by 
a single-user decoder, the multi-user detector chosen for the TurboMUD can generate these 
soft values. The invention described below will work with soft output or a hard output 
MUDs, or a combination of the two. 
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[0041] In general, soft or hard output versions of the optimum maximum likelihood multi- 
user detector (Verdu, Multiuser Detection, Cambridge University Press, 1998) or an M 
algorithm (as described, for instance, in Schlegel, Trellis Coding, IEEE Press, 1997) with a 
moderate to high value of M causes the Turbo MUD to require too many computations to 
keep up with real time transmissions. Using a fast, but inferior, multiuser detection 
scheme such as a linear-based detector or those detailed in the text "Multiuser Detection" 
by Sergio Verdu causes poor quality output when there are many interferers or users. 
[0042] Moreover, when dealing with hand-held communications units such as wureless 
handsets, the amount of processing within the device is limited, directiy Umiting the 
amount of computational complexity that is allowed. In order to provide real-time 
performance both at a cell site and the handset, it therefore becomes important to be able to 
reduce the amoimt of computational complexity and processing time so as to achieve real- 
time performance. 

[0043] A great number of communications and data transfer systems operate at or near a 
full capacity. Conventional receiver performance is imsatisfactory in the presence of co- " 
channel interference. Furthermorp, many receivers require prior knowledge of signal ^ 
parameters such as phase and amplitude of the channel to perform processing functions 
without co-channel interference. -What is needed is a means to increase the number of 
available channels, by reassigning channels to be perhaps slightiy interfering, thereby 
increasing the overall throughput without increasing bandwidth. Such a system should 
provide an efficient means of jointly estimating symbols, channel amplitude, and data rate 
transmitted in a super-saturated communications channel. And, any such invention should 
have the ability to estimate the symbols and data rate, blindly without prior knowledge of 
channel amplitudes and phase. 

BRIEF SUMMARY OF THE INVENTION 

[0044] While adaptable in many forms, one embodiment of the invention is an apparatus 
for processing a digital data stream from multiple users, comprising an initial amplitude 
estimation unit processing the data stream and producing initial amplitude estimates on a 
first iteration. There is a joint amplitude estimator coupled to the data stream and the 
initial amplitude estimator, wherem the joint amplitude estimator produces updated 
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amplitude estimates. A symbol estimator is coupled to the data stream, the initial 
amplitude estimator, and the joint amplitude estimator, wherein the symbol estimator 
produces a plxirality of symbols estimates for each user. There is a bajok of decoders 
coupled to the symbol estimator, producing a plurality of symbol likelihood estimates for 
each user, wherein tiie symbol likelihood estimates are iteratively fed back to the symbol 
estimator and the joint amplitude estimator until a final condition is obtained. 
[0045] Further variations of the invention include wherein the final condition is selected 
firom at least one of the group consisting of: bit error rate metric level and fixed number of 
iterations. 

[0046] The symbol hypothesis testing module can generally be any module that is a 
member selected from at least one of the group consisting Minimum Mean Squared Error 
(MMSE), maximiHn likelihood, M-algorithm, T-algorithm, and Q-algorithm, decorrelating 
decision-feedback detector (iDDFD), improved decorrelating decision-feedback detector 
(EDDFD), successive interference cancellation (SIC), parallel interference cancellation 
(PIC) and multi-stage detector; block-iterative interference cancellation; and a deferred 
decorrelating decision-feedback detector. ^ 
[0047] The apparatus can fiirther comprise an energy detector coUpled to flie joint 
amplitude estimator which can detennine activity on a channel, such as code^ time slot o^ 
firequency chanjiel. 

[0048] The bank of decoders can be selected from at least one decoder of the group 
consisting of: Soft-output Viterbi , Maximum A Posteriori , and BCJR. 
[0049] Another embodiment includes a method for providing initial amplitude estimation 
for a plurality of user chaimels, comprising separating the user channels into active 
channels and inactive channels. Processing the inactive channels is according to the sub- 
steps comprising applying a bank of filters to the inactive channels for each inactive user, 
squaring an output firom the filters, summing an output from the squaring operation, and 
calculating an average bias estimate from an output of the summing. Processing the active 
chaimels is according to the sub-steps comprising appljdng a bank of filters to the xiser data 
of the active chaimels, squaring an output from the bank of filters, summing an output from 
the squaring, and removing the average bias estimate from an output from the summ ing . 
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[0050] The method can include processing the inactive channels by dividing the inactive 
channels into groups and calculating at least one group average bias estimate. It can also 
include processing the active channels by dividing the active channels into groups and 
respectively removing the at least one group average bias estimate. 

[0051] A further variation includes scaling by the summers and may include an energy 
detector for separating the active channel from the inactive chaimels. 
[0052] Another embodiment of the invention is an apparatus for processing initial 
amplitude estimates from a data stream of multiple users, comprising an energy detector 
measuring an energy level of tiie data stream, wherein the energy detector separates a 
plurality of active channels from a plurality of inactive channels. There is a bank of filters 
coupled to the inactive channels, a bank of squaring operators coupled to the bank of 
filters, a bank of inactive simmiers coupled to the squaring operators, and an average bias 
estimator coupled to the inactive summers calculating an average bias estimate. There is 
also a bank of filters coupled to the active channels, a bank of squaring operators coupled 
to the bank of filters, a bank of active summers coupled to the squaring operators, and a 
bias removal section coupled to the active summers and the average bias estimator. 
[0053] Another embodiment includes a joint ^plitude estimator for a data stream froin 
multiple users, comprising a data stream from the multiple users divided into a plurality of 
observation intervals. There are a plurality of processing modules coupled to the 
observation intervals^ vsdierein the processing modules calculates interference cancellation 
values for each of the users and computes a filter for each of the observation intervals. The 
filter being applied to the data within the observation interval is used to compute individual 
amplitude estunates. There is an amphtude estimation unit that processes the individual 
ampUtude estimates and calculates new amplitude estimates, wherein the new amplitude 
estimates are iteratively passed back to the processing moduleis imtil a final condition is 
obtained. 

[0054] One aspect includes where the amplitude estimation imit sums and weighs the 
individual estimates such that the new amplitude estimate is a weighted average of 
individual amplitude estimates. 
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[0055] It should be understood that the observation intervals may be based on 
distinguishing attributes selected from at least one of the group consisting of: time, code, 
and frequency. 

[0056] The filter may be calculated for an arbitrary observation interval m by the 
equation: 

[0057] Yet a further embodiment includes a method for processing amplitude estimates for 

a multiuser data stream divided into a plurality of observation intervals, comprising 

computing a filter for each observation interval of the data stream, applying interference 

cancellation to the data stream for each observation interval, applying the filter to the data 

stream for each observation interval from the interference cancellation to produce 

individual amplitude estimates for each observation interval, computing nev^ amplitude 

estimates using the individual amplitude estimates; and passing the new amplitude 

. - ft 

estimates back to the filter for iterative processing imtil a final condition is obtained. 
[0058] One embodiment is a turbo-decoding system for joint signal demodiilation, 
comprising a data stream from a plurality of users divided into a plurality of observation 
intervals. There is a plurality of symbol processing nodes processing the observation 
intervals to compute symbol estimates for the data stream within the observation interval. 
A plurality of decoder nodes is present for processing the symbol estimates and producing 
a plurality of symbol likelihoods. There is a plurality of amplitude update nodes 
processing the symbol likelihoods and calculating a plurality of amplitude update vectors. 
And, there is an ainplitude estimator node processing the amplitude update vectors and 
producing an amplitude estimate update, wherein the ampUtude estimate update is passed 
back to the processing nodes for iterative processing between the symbol processing nodes, 
the decoder nodes, the amplitude update nodes, and the amplitude estimator node until a 
final condition is obtained. 

[0059] The turbo-decoding system can encompass, a plurality of memory units, wherein at 
least one memory imit is coupled to each of the processing nodes and to each of the 
amplitude estimation nodes. There can also be a first system area network (SAN) coupling 



15 



wo 2004/095713 



•CT/US2003/038190 



the processing nodes and the decoder nodes, a second system area network (SAN) 
coupling the decoder nodes and the amplitude estimation nodes, a third system area 
network coupling the amplitude estimation nodes and the amplitude estimator, and a fourth 
system area network coupling the amplitude estimator and the processing nodes. The 
processing nodes, the decoder nodes and the amplitude estimation nodes can even be one 
set of nodes. A set of nodes can also be comprised of at least one processor. 

[0060] A further embodiment of the system includes wherein the likelihood symbols 
from the decoders are accessible at any time for post-processing. The processing nodes 
can further comprise a thresholder to convert the symbol estimates to symbol bits. And, 
the bank of decoder can also perform de-interleaving, rate identification, de-puncturing and 
de-scrambling. 

[0061] The features and advantages described herein are not all-inclusive and, in 

particular, many additional features and advantages will be apparent to one of ordinary 

skill in the art in view of the drawings, specification, and claims. Moreover, it should be 

noted .that the language used in the specification has been principally selected for 

• ft - * ^ , ' ' ' . ■ ' .* 

readability and instructional purposes, and not to limit the scope of the inventive subject 

matter. * 

BRffiF DESCRIPTION OF THE DRAWINGS 

[0062] Figure 1 illustrates a basic diagrammatic processing perspective for the prior art 
transmitter end and receiver end with iterative multiuser detection processing; 
[0063] Figure 2 is a block diagram perspective showing joint symbol amplitude and data 
rate estimator for one embodiment of the invention; 

[0064] Figure 3 is a schematic block diagram showing initial amplitude esthnation; 
[0065] Figure 4 is a schematic block diagram showing joint amplitude estimator; 
[0066] Figure 5 is a schematic block diagram showmg parallelization of joint symbol 
amplitude and data rate estimator; and 

[0067] Figure 6 is a schematic block diagram showing hardware conceptualization 
according to one embodiment of the present invention with nodes used to . depict the 
processing. 
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DETAILED DESCRIPTION OF THE INVENTION 

[0068] The methods and embodiments of the jomt symbol, amplitude and rate estimator 
disclosed herein enable implementations of advanced receiver processing providing high 
quality real-time processing for multiple access systems, including overloaded situations. 
One embodiment illustrates the signal processing technique applicable to many variations 
that are all within the scope of the invention. It should be xmderstood that the reference to 
a data stream of multiple users is understood to represent digital data from a number of 
references and not necessarily multiple users in commimications. 

[0069] In order to fully appreciate the processing of the present invention, it is usefiil to 
consider a description of the processing known in tiie art. Referring to Figure 1, prior art 
transmitter section 5 and receiver section 10 is depicted with MUD processing. There are 
K users with data di - dk as the input to the channel encoders 20 where the data is 
convolutionally encoded at a code rate Rk. The interleaver 25 performs the interleaving of 
the data, which is then symbol mapped by the symbol mapper 30, such as BPSK symbol 
mapped with data symbols having a duration T. Each data symbol is then modulated by a 
spreading waveform from the spreader 35, and the signal data Sk(t) is output from the 
transmitter end with a number of data symbols per user per frame. In the typical wireless 
commxmication application presented herein, a number of users (1-K) generate signals that 
are transmitted by antennas 40 into free space. The transmission on hard-wire or fixed wire 
systems is also known in the art, however the present example is directed towards the 
wireless communications systems. Noise n(t), such as white Gaussian noise, is an inherent 
component of most transmission signals and some random noise components are generally 
present in transmitted data signals r(t). 

[0070] On the receiving end 10, antenna(s) 42 receive the transmitted signals r(t) as well 
as the various interfering signals, such as n(t). There is normally a noise component n(t) 
that is introduced from the environment of a random nature in the received signal. While 
any noise that has a repeatable or non-random nature can be eliminated through processing, 
random noise elements are reduced in other manners such as filtering but are inherent 
conditions. The various signals are received at antennas 42 wherein there is typically one 
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signal for each polarization feed. The received signals represent directly received signals, 
as well as multi-path signals from the same user, and interfering signals j&om other users. 

[0071] The input signals of raw non-manipulated data at the receiver 10 is comprised of 
the aggregate of many signals from many different transmitters/users, where each signal is 
generally assigned a chaimel (frequency, timeslot, and/or spreading code) from a finite set 
of channels. The interference from these various users generally requires complex 
processing and associated processing time. The input data represents a vector of data, 
transferred at some rate (e.g., the symbol rate), and this data is typically transmitted to a 
matched filter (not shown). 

[0072] The plurality of signals from each antenna 42 is processed in a RF front end xmit 
44. The front end unit 44 generally downconverts the higher frequency signals into 
baseband signals and provide processing and filtering as is known in the art. The baseband 
signals are also digitized by analog to digital converters (ADC). The front end 44 
cooperates with the parameter estimation unit 46 to retrieve needed information for the 
si^ials such as relative received timing offsets, ciarrier phase, frequency ojffsets, received 
amplitudes, and multipath structure for each of the interfering signals present in the 
received signal. 

[0073] A parameter estimation unit 46 is coupled to the front end 44 and processes the 
various parameters for the received vector data. A parameter estimator 46, as known in the 
art, is a broad description for a unit that provides information to the MUD 50 such as 
convolutional code, . signatures, and multiplexing format. While the term parameter 
estimator is used herein, the term is intended to be interpreted in the broader sense as 
known in the joint demodxilation field. The parameter estimation module 46 generally tries 
to estimate timing, signal amplitudes, phases, polarization, and identification of 
transmission chaimels. 

[0074] The MUD element 50 consists of fimctional blocks that process the digital data 
and extract the user symbol streams. The MUD 50 performs pre-processing and converts 
the baseband digital data into the proper format for further processing according to the 
detection scheme. The format is often one measvurement per 'dimension' per symbol. The 
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MUD unit 50 outputs a bit (or symbol) stream associated with each interfering signals 
present on the channel for one data block. 

[0075] Turbo MUD processing assumes knowledge of various parameters such as 
relative received timing offsets, carrier phase, frequency offsets, received amplitudes, and 
multipath structure for each of the interfering signals present in the deceived signal. This 
input data represents raw data after some front end processing such as downconversion, 
amplification, and analog-to-digital conversion. MUD 50 needs some raw data parameters 
in order to establish accurate decision trees for processing. 

[0076] There is an iterative processing topology with a MUD section 50 that interacts 
with the K number of chaimel decoders 65 for the received signal r(t) for each user. The 
multiuser detector SO is generally a SISO detector, receiving and outputting soft values, 
although hard value processing is known in the art. The MUD section 50 utilizes a priori 
information and delivers values such as a posteriori log-likelihood ratio (LLR) data of a 
transmitted *+l ' or '-1* for each bit of every user. Certain a priori information is computed 
by the channel decoder 65 from a previous iteration which is interleaved and fed back to 
the MUD section 50. The int^leav^. 60 and deinterleaver 75 perform the interleaving and 
deinterleaving frinctions respectfully for the encoding schema. Likewise, the summers 55, 
70 perform the summations for the data. Once the processing is complete the channel 
decoders 65 produce the output data stream (dk)' representing the best estimate of the 
transmitted signal (dk). 

[0077] Various schemes for TurboMUD processing are well known in the art and utilize 
a priori information about the received signals wherein the processing continues through a 
number of iterations until certain conditional parameters are satisfied. The basic Turbo- 
Mud procedure is presented in published literature such as Poor, "Turbo Multiuser 
Detection: An overview," IEEE 6* Int. Symp. On Spread-Spectrum Tech. And Appli., 
NJIT, New Jersey, Sept. 6-8, 2000 and Alexander, Reed, Asenstorfer, and Schlegel, 
"Iterative Multiuser Interference Reduction: Turbo CDMA," IEEE Trans. On Comms., 
v41, n7, July 1999. The iterative MUD algorithm such as representative of the approaches 
used to incorporate turbo decoding methods into joint MUD/FEC (Fourier Error 
Correction) decoding and to then reduce the complexity of the system are known in tihie art. 
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[0078] The bit streams firom the MUD 50 are passed to a bank of error correction 
decoders unit 65. The decoders 65 calculate conditional probabilities, one for each 
decoded symbol of each user, and output them as confidence values back to the MUD 50. 
In one embodiment, there is a bank of error correction decoders 65 such as Viterbi 
decoders or MAP. 

[0079] Soft outputs for each bit of each user from the bank of decoders 65 is fed back to 
the multiuser detector 50 for each iteration, one stream of outputs for each interfering user 
present in the received signal. These soft outputs are passed back to the MUD SO to 
produce an improved stream of soft bit (or symbol) decisions that fed into the decoder 65 
for another iteration of improvement. The information between the MUD 50 and the 
decoders 65 repeats in subsequent iterations xmtil an asymptote is reached or the desired 
performance level is attained. At that point, estimates of the data sequences for all active 
users are output. Operation then commences for the next block of data, repeating the 
process described above. The multiuser detector 50 takes these soft inputs along with the 
original raw input signal to calculate an improved, less corrupted bit stream for each user! 
This iterative process continues until a desired metric is reached or^ a fixed number is 
reached. At that point, estimates of the data^ sequences for all active users are ou^ut 
Operation then commences for the next block of data, repeating the process described 
above. 

[0080] The number of iterations for processing between the MUD 50 and the decoders 
65 can be set to a fixed counter or by checking if there were significant changes to the data 
from the last iteration. Once the data is no longer being altered or reaches a certain 
iteration counter limit, the data from the decoder 65 can be output as final estimates of 
what the user sent, (dk)'. A fixed number of iterations can be stored and used and 
proceissed by the decision block. Alternatively, the information between the mxiltiuser 
detector 50 and the decoders 65 repeats in subsequent iterations imtil an asymptote is 
reached or the desired performance level is attained. A buffer can store the previous values 
and compare them to the latter processed values during the iterative process. 

[0081] MMSE detectors with prior information are usually described in a turbo or 
iterative application, but also ftmction in non-iterative operations. As described herein, a 
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complete solution to the problem is computed, and one approximate (low complexity) 
method is described. Furthermore, prior schemes are generally inferior for overloaded 
systems, and this is alleviated by the present invention. 

[0082] The apparatus in Figure 2 is now described in more detail. A Data Stream 100, 
potentially complex, is received from some source. For the case of Code Division 
Multiple Access (CDMA) communications schemes, the data stream is sampled by some 
multiple of the chip rate. For TDMA communication schemes, the data stream is sampled 
at some multiple of the symbol rate. The received signal can effectively be modeled as the 
linear combination of many co-channel signals, mathematically illustrated by 

F K 

. = ES** W^A"^* W - 'Ti) + n^inT,) . Equation 1 

[0083] The term represents the symbol for user k at time index /. The term 

* ■ *■ * 

5jfc(«2^ — iTJ) represents the user's signal characteristic for sample index n for the symbol 

period i for the user k. The characteristic waveform, ^j^(-), is normalized such that the 

^amplitude of the transmitted signal for user ^ is represented by . The symbol period is 

defined by 7] and the sample period is represented by T„ . The linear model represents a 

summation of K separate users and over F symbols. The received signal can represent a 
collection from multiple sources. For example, the collection may consist of multiple 
antennas, not necessarily co-located, with potentially different polarizations. 

[0084] The user's signal characteristic, 5'^(«7), ~z7]) , combines a sequence of signal 

transformations: pulse shaping filter, signal delays, receiver filters, code channels (for 
CDMA) and multipath. It should be understood that the characteristic waveform is not 
restricted to a sample or symbol interval. 

[0085] The niultiple access interference model in Equation 1 is represented concisely 
using matrix notation in Equation 2. The symbols, corresponding to 
the isCF simultaneous users, is represented by the iCF -element vector, b[z] at time i. A 
linear model is used to represent the received signal at time index i by 
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r[/] = SAb[/]+ii4/], 



Equation 2 



where h[i] represents the AIF symbols at time index /. The term S is a N>iKF matrix 

representing the combination of the spreading code, cliaimel codes, piilse shaping filter, 
and propagation effects 



S = 



s^[N,i] s^[N,i] ... s^[N,i] 



Equation 3 



[0086] The matrix entries, [«,/] , represents the n* sample of the signal characteristic 

^veform for user k at time i . The term A is a ^xiCF diagonal matrix that represents 
the complex signal amplitudes, which can include phase and polarization weighting. The 
term ii^[z] isaJVxl vector that jepresents additive wMte Gaussian noise (AWGN).^ 

[0057] Referring again to Figure 2, the data stream 100 represents a vector of data, 
representing the received signal. This data is communicated to the symbol hypothesis 
testing module 105 which is indicated in this implementation as Minimum Mean Squared 
Error (MMSE) based symbol estimator. The MMSE implementation for symbol hypothesis 
testing 105 is better for computational efficiency however olher forms of symbol 
estimators such as the maximum likelihood detector are within the scope of the invention. 
The maximum likelihood detector is a brute force approach which performs an exhaustive 
evaluation of the Euclidean distances between the received samples and the linear model of 
the samples using every possible hypothesis of the bit sequence. Instead of the optimal 
detector for the symbol estimation testing module 105, more computationally efficient 
forms of symbol hypothesis testing includes, the M-algorithm, T-algorithm, etc. Although 
the most eflScient approach appears to be the MMSE-based symbol estimator, another 
efficient implementation for symbol estimation is referred to as the Q-algorithm, disclosed 
in U.S. Patent application Serial No. 10/105,918, filed March 25, 2002 entitied "System for 
Decreasing Processing Time in an Iterative Multi-User Detector System". The Q- 
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algorithm used for the MMSE-symbol estimator cancels interference based on symbol 
estimates obtained from thresholding the symbol likelihoods. 

[0088] The computational efficiency utilizing the MMSE based symbol estimation is 
described in further details for the MMSE-based joint amplitude estimation and the 
parallelization of the algorithm. It should be noted that the symbol hypothesis testing is 
performed following completion of the initial amplitude estimate. 

[0089] The same data vector 100 is passed to the initial amplitude estimation module 
110. As indicated by the name, the purpose of initial amplitude estimation module 110 is 
to provide an initial estimate of the amplitude for each interfering signal. The approach 
implemented in initial amplitude estimation module 110 consists of incoherently 
integrating the energy out of a properly normalized matched filter for all users determined 
to be active. Note these users can be transmitted over code, temporal, and/or frequency 
channels and this estimation procedure are appropriate for these modes of digital 
communication. Part of the frmction performed in the initial amplitude estimation module 
110, is the identification of which users are active. User identification can be easily 
obtained via an. energy detector based on some sufficiently long observation interval that 
can determine activity on a channel: code, time slot, or frequency channel. The energy 
detector is generally integrated within the ifdtial amplitude estimation modtile 110. 
[0090] The initial amplitudes from the estimator 110 are passed to the MMSE-based 
symbol estimator 105 and the MMSE-based Joint Amplitude Estimator 120. The MMSE- 
based symbol estimator 105 uses the initial amplitude estimate only on the first iteration, 
for all other iterations it uses the amplitude estimates from the MMSE-based Joint 
Amplitude Estimator 120. This module may implement any type of symbol hypothesis 
testing procedure however an MMSE-based approach is used because of computational 
efficiency. It should be imderstood that other linear detectors, such as the decorreator, may 
also be used for computational efficiency. While the linear detectors are efficient, the 
performance is sacrificed through BER performance and loading capacity. 

[0091] There are numerous non-linear symbol-hypothesis testing procedures adaptable to 
the present invention, including the M-algorithm/T-algorithm; maximum likelihood; 
decorrelating decision-feedback detector (DDFD) and improved decorrelating decision- 
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feedback detector (EDDFD); successive interference cancellation (SIC), parallel 
interference cancellation (PIC) or multi-stage detector; block-iterative interference 
cancellation. A further symbol-hypothesis testing implementation is described in the 
commonly owned application entitled '"Deferred Decorrelating Decision-Feedback 
Detector for Supersaturated Commtmications", U.S. Patent Application Serial No. 
10/423655, filed April 25, 2003. 

[0092] The MMSE-based symbol estimator 105 provides symbols estimates to a bank of 
single user decoders 130. The symbols estimates from the symbol estimator 105 are routed 
to the decoders 130 based on user niunber. For example, all sjonbol likelihoods 
corresponding to the first user are passed to the decoder dedicated to the first user. This 
mapping is repeated for all users. The single user decoders consist of a bank of K 
decoders that may consist of Soft-output Viterbi or Maximum A Posteriori based 
techniques such as BCJR. Sofl-input, soft-output (SISO) decoders are used in one 
embodiment for this implementation, but hard values can be processed as is kno\vn in the 
art. . / 

[0093] Inh^entin most processing is a de-interleaving, rate identification, de-pimcturing 
and de-scrambling processing depicted iherely 'as deinterleaving 125 for illustrative* 
pxuposesv The symbol estimates and their respective likelihoods are output from the bank 
of decoders 130 for one of the most-likely data rates. In addition, symbol likelihood 
estimates firom the decoders 130 are also fed back around to both the MMSE-based symbol 
estimator 105 and MMSE-based amplitude estimator 120 respectively. However, prior to 
feedback the symbol likelihoods are processed by an interleaving section 135 performing 
re-interleaving, re-puncturing, re-scrambling and symbol repeating (for low rate firames). 
The purpose of this re-coding task is to properly emulate the interference. 
[0094] As referenced herein, the term *data rate' refers to the rate at which the 
information bits are transmitted over the channel, defined in terms of bits per second. 
Sometime the data can be transmitted at one of a number of different rates. The 
likelihoods of the bits are maximized when the data rate is known. However, if the 
receiver is non-cooperative, the 'most likely' data rate must be identified. One approach 
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for determining the most likely data rate is through joint maximization of the bit 
likelihoods and data rate. 

[0095] The new symbol likelihoods from the decoders 130 are used to update previous 
amplitude estimates in the joint amplitude estimator 120. The procedure for processing the 
MMSE-based amplitude estimator is described in further detail herein. The new amplitude 
estimates from the joint amplitude estimator 120 are passed to the MMSE-based Symbol 
Estimator 105 that re-estimates the symbols based on prior information on the symbols and 
the updated parameter estimates. These updated symbol estimates from the symbol 
estimator 105 are then re-routed to the bank of decoders 130. This process is repeated for 
either some fixed number of iterations or until the estimates of the bit error rate or frame 
error rate achieve some desired level. 

[0096] Figure 3 illustrates the system and processing for calculating initial amplitude 
estiinates with the Initial Amplitude Estimator 110 of Figure 2. Referring to Figure 2 and 
Figure 3, the initial amplitude estimate obtained from the Initial Amplitude Esthnator 110 
is based on tiie incoherent integration of the matched filter output of the Active Channel 
180 over some observation interval. The procedure consists of applying a bank of matched 
^filters 20S,'230> 255 for all users K and all symbols Fto the received data vector 100. 
The KF X 1 vector representing the matched filter outputs is represented by 



[0097] The estimate of the amplitude for a user k is based on summing the squared 
outputs of the matched filter for that user k, summed over an observation interval of F 
symbols. An estimate of the amplitude squared is represented by 



[0098] The summation over F is performed for all K users to obtain K initial 
amplitude estimates for the K users. Since the estimate is based on incoherent integration. 



y = S^r = S'^SAb + S'^n^ . 



Equation 4 



Equation 5 
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the estimate is biased which is illustrated by the expected value of the initial amplitude 
estimate for an arbitrary users k, 

E[al] = al +^j:tpl W Equation 6 

where the bias is represented by the last term in Equation 6 and pjj^ (/) = (s" (z)sy {i)f , 

Note that the bias in Equation 6 is based on the expected values of the bits in the vector b, 
in Equation 4, equaling zero. This is mathematically represented by 

2 

In addition. Equation 6 assumes the symbols are defmed such that b. = 1. A generalizatiqn 

o^Equation 6 for the case of non-zero mean bit values and b.^^ .1 is obvious. 

[0099] An inactive user 190 is defined, as a signatare signal that is not present. The 
expected value of the amplitude estimate for an arbitrary inactive channel is defined as 

E[aL.^]=^ttp^j^{iyj Equation? 

where Py.&«,crivc(0~(*i^criw(0^y (O)^ • Since no signal is present for inactive users, a 
nonzero value for tiie expected value of the estimate for amplitude squared for inactive 
signature vectorjs is due to the presence of other active channels. Comparing Equations 6 
and 7 reveals that examination of the estimate of squared amplitudes for non-active users 
can be used to estimate the bias relative to active users. Thus the Initial Amplitude 
Estimation concept is based on Equations 5 through 7 is illustrated in Figure 3. 
[00100] A data stream 200 representing the received signal, potentially complex, is 
received from spme source. A bank of filters 205, 230, 255, 280 and 300 is applied to the 
data stream 100. The Active Channel 180 set of filters 205, 230, 255 consists of filters 
matched to users that have been determined to be active based on an energy detector. 
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Filter 205 corresponds to a filter matched to the signature signal for user 1. Similarly, 
filter 230 corresponds to a filter matched to the signature signal for user 2. It should be 
noted that the size of the bank of matched filters is based on the number of users 
determined to be active and represented by the term K . 

[00101] The output data firom each matched filter (1-K) 205, 230, 255 is then processed 
by a squaring operation 210, 235, 260 for 'all K users. The outputs from the squaring 
operatio3as 210, 235, 260 for each active channel user (1-^ are then passed to a bank of 
summers 215, 240, 265 which performs the incoherent integration. The bank of summer 
215, 240, 265 for each user l-J^used to implement Equation 5, accumulates energy in each 
of their respective processing chains. The proper scaling required for averaging is included 
in the output of the simuners. As previously mentioned these estimates of squared 
amplitude are biased, therefore, the bias removal component must be implemented to 
minimize the bias. 

[00102] A set of filters 280, 300 for the Liactive Channel 190 representipg those channels 
that were determined to be inactive based on an energy detector. The bank of filters 280, 
30.0 matched to thp inactive user's signature sigrials is applied to tiie data stream 200 and 
processed. The size of the bank of filters 280, 300 is based on the number of users 
determined to be inactive, defined herein as the mmiber J. 

[00103] Following the processing of the inactive filters 280, 300 for the users 1-J, the user 
data is fiirttier processed by a squaring operation 285, 305 for all J users. The outputs from 
the squaring operations 285, 305 for each inactive channel user (1-J) are then passed to a 
bank of summers 290, 310. The bank of summer 290, 310 for each user 1-J accumulates 
energy in each of their respective processing chains. The proper scaling required for 
averaging is included in the output of the sunraiers 290, 310, wherein the output is then 
processed by the Average Bias Estimator 295. 

[00104] Following the computation of the amplitude bias by the Average Bias Estimator 
295, the bias is passed to Bias Removal 220, 245, 270 that is applied to each of the Active 
Channel Users l-K. The illustration in Figure 3 shows the output from the active channel 
summers 215, 240, 265 going to all bias removal modules 220, 245, 270. The initial 
amplitude squared estimate obtained for the active chaimels has an estimate of the bias 
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removed in modules 220, 245, 270. Following the bias removal, the initial amplitude 
squared estimates 225, 250, 275 for each user l-K are passed out to the remaining 
processing shovioi in Figure 2. 

[00105] Figure 3 shows the computation of one amplitude bias in modxile 295, which is 
then applied to all bias-removal modules 220, 245, and 270, for all active channels. The 
processing configuration of Figure 3 is also extendable to other embodiments in which 
different biases are computed for different sets of active channels, which are obvious 
extensions and within the scope of the present invention. For example, in a wireless 
communications system the bias removal could be restricted to chaimels within certain 
sectors. More specifically, for all non-active channels identified in a sector, a bias estimate 
is computed and then applied to all active channels in the same sector. Therefore, active 
channels identified in a different sector will only use a bias based on non-active channels 
in this same sector. 

[00106] Referring to Figure 2, the initial amplitude squared estimates are updated in the 
MMSE-based Joint Amplitude Estimator 120. The apparatus fpr updating these amplitude 
squared estimates is described in Figure 4. The solution described in Fi^re 4 is based on 
the assumption that the amplitude is nearly constant over some observation ^Aondow. Since 
this solution is based on the MMSE criteria, this approach is easily extended to time- 
varying amplitudes, however, only the windowed based estimator is detailed herein for 
illustrative purposes. For example, when the amplitude fluctuates by rate, the amplitudes 
over multiple frames are linked based upon the data rate. Incorporating data rate into the 
amplitude estunator, the module described in Figure 4 actually evolves into an energy-per- 
bit estimator. When the amplitude fluctuates based on temporal instability, then this 
implementation is extended into a Kalman filter framework by using a temporal model of 
the channel variation. The Kalman filter equation itself is well known to those in the art. 
[00107] Referring to Figure 2 and Figure 4, the stream of data representing the received 
signal 100 is passed to the MMSE-based Joint Amplitude Estimator 120. This data stream 
100 is divided up into M observation intervals as illustrated by , , through 400, 
405, 410. These intervals can be based on distinguishing attributes such as time, code, or 
frequency. For the example presented herein, the observation interval refers to time. 
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These observation intervals each enter a different processing section 415, 420, 430, 
representing 1 to M observation intervals. In each processing module, 415, 420, 430, an 
MMSE filter is computed which is defined for an arbitrary observation interval m as: 



G„^B„Sl[S„x„Sl^all)'\ 



Equation 8 



[00108] The teraiB^ refers to a diagonal matrix representing the expected bit or symbol 
value in interval m for all K users. For example, 



fii 0 

0 4 



0 0 



0 



Equation 9 



where b,^ represents the e}q>ected value of the symbol for user k . For illustrative purposes, 

for BPSKiSystems, = 2/?^^ -1 , where the probability that the symbol equals 1. 

The term S„ , is similar to that presented in Equation 2, except here S„ represents the "S- 
matrix" defined over a single observation interval, v^rhile S in Equation 2 is defined over 
F observation intervals. Therefore, S in Equation 2 contains the S„ defined herein. The 
term Xm represents the amplitude uncertainty and is rewritten as: 



;ir„=(a+A^)A„ 



Equation 10 



which more clearly defined the amplitude uncertainty in terms of the bit uncertainty, A„ , 
the initial amplitude uncertainty, Q, and the expected value of the amplitude A. The 
terms, A^, Q, and A are all diagonal matrices, and each element along the diagonal 

corresponding to a different active user, therefore, Xm ^ diagonal matrix. This 

formulation assumes the amplitudes and corresponding uncertainties are constant for the 
M observation intervals. 
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[00109] Following the computation of M different MMSE filters in each of the 1-M 
modules 415, 420, 430, each MMSE filter is applied to components of the respective data 
stream 400, 405, 410 corresponding to the same observation interval. 
[00110] Prior to application of the MMSE filter, interference cancellation is performed 
based on the expected value of the bits, represented by and prior estimates of the 

amplitude for all K users, represented by the Kx\ vector a'"^ The interference 
cancellation is represented by 

^ ^r^^S^Bj-' . Equation 11 

And, each MMSE processor cancels out different interference. 

[00111] Therefore, application of the MMSE filter following interference cancellation is 
represented by 

: ", Aa,=G„(7;„-S„B„a'-') Equation 12. 

[00112] The amplitude estimator is defined in module 435 of Figure 4. It is an iterative 
fomiulation that updates previous estimates, represented by a'"^ , with information based on 
new estimates of the users' bits and data rate. These new estimates are represented by a' . 
The MMSE-based joint amplitude estimator 435 combines the individual estimates, A£„, 
in a weighted manner. The weigjhted combination in this embodiment is expressed by: 

Q^'" + fi''" jZ^a. Equation 13 

where 

M 

Q = ^Q„ Equations 

iR-l 
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[00113] As indicated by Equation 13 and illustrated in Figure 4, the individual estimates 
from the filter modules 415, 420, 430 are passed to the Amplitude Estimator 435 for each 
interval 1-M. These individual estimates, A£„ , are ^xl vectors which are then summed 
together in Amplitude Estimator 435. Theses estimates are then weighted 
by(Q''^ {Q + iy^ Q^'^) where Q*^^ is a diagonal matrix containing the standard deviation 

of the amplitude estimates and Q is represented in Equation 14. The term Q is a 
summation of quadratics computed in tiie filter modules 415, 420, 430. Each quadratic is 
computed by 

a=<5AB« Equation 15 

which is shown to be based on the MMSE filter for the respective observation region 
defined in interval m . Therefore each filter module 415, 420, 430 responsible for an 
observation interval must compute: the MMSE filter (see Equation 8); apply interference 
cancellation to the section of the data stream corresponding to the observation interval (see 
Equation 11); apply the MMSE filter to the interference mitigated data vector (see 
Equation 12); and computed the quadratic (see Equation 15). 

[00114] In this embodiment the outputs of each module 415, 420, 430 is a Kxl yector 
Ag^ and a hermitian matrix Q„ which is a iTxJC matrix with K{K'{-l)/2 unique 
elements. These two outputs are passed to the Amplitude Estimiator 435 for each interval 1- 
M. The prior amplitude estimates, a'"^ are updated to a' and passed out from the 
Amplitude Estimator 435 back to the individual filter modules 415, 420, 430 defined for 
the observation intervals. This feedback is part of the iterative implementation inherent in 
this MMSE formulation. 

[00115] Referring again to Figure 2, it should be understood that the amplitude estimates 
are not updated until the bits (or symbols) have been updated in the bank of decoders 130. 
Following the processuig by the MMSE-based symbol estimator 105 and the decoders 130 
in Figure 2, the symbols are fed back to the MMSE-based joint amplitude estimator 120 
where prior amplitude estimates are updated as detailed herein. As previously described. 
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the number of iterations is typically dictated by the user which is defined for either a fixed 
iteration number or until the frame error rate or estimate bit error rate metric reaches some 
desired level. 

[00116] Once processing is completed, the updated amplitude estimate vector 440 is 
transmitted from the MMSE-based Joint Amplitude Estimator 120 to the MMSE-based 
symbol estimator 105. Since for this particular implementation, the amplitudes are 
constant over the observation interval described for M smaller observation intervals, the 
same amplitude estimate 440 is reported to each MMSE-based symbol estimator 105, This 
approach is easily modified for a variable amplitude estimate which varies over the entire 
observation interval. 

[00117] The apparatus in Figure 5 is one embodiment of a hardware implementation that 
provides a more detailed characterization of the processing shown in Figure 2. It should 
be noted that Figure 5 shows an implementation that parallels the joint estimation based 
on M observation intervals. The data stream in 100 of Figure 2 is partitioned into 
M observation intervals as the data vectors 500, 502, 504, 506 representing intervals 1 - 
M As in previous diagrams, the nuniber of observation intervals is generalized 
to Af intervals. The observation intervals may overlap (not necessarilj^ disjoint) to account 
for asynchronous reception and muWpath signals. 

[00118] These data vectors 500, 502, 504, 506 are processed immediately by a bank of 
MMSE filters which are implemented on the M processors, 520, 522, 524, 526, for all 
M observation intervals and each processor simultaneously estimates symbols for all 
K users. The filter processing can be considered a node, wherein a node is a generic 
processing component as is known in the art, such as a microprocessor chip or processing 
board, and wherein it should be readily apparent that liiere can be more than one processor 
per node. 

[00119] In addition, these data vectors 500, 502, 504, 506 are read into local memory 510, 
512, 514, 516 on the processors, because they are used repeatedly for the multiple 
iterations in this iterative approach to symbol, amplitude, and rate estimation. Any of the 
forms of memory known in the art are applicable. 
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[00120] Each symbol estimation MMSE processor, 520, 522, 524, 526 outputs , which 
is a vector of K symbol likelihoods or symbol bits for observation interval m . Recall that 
K corresponds to the number of users and therefore bm represents the collection of symbol 
estimates for K users in observation interval m. Symbol bits are provided on the output if 
thresholding is performed within the MMSE processor 520, 522, 524, 526. The output 
from each MMSE processor, 520, 522, 524, 526, is intended to represent M Kx\ vectors 
passed to the system area network (SAN) 530 which will then pass K Mx\ frames of 
likelihoods (or symbols) to a bank of decoders 532, 534, 536, 538 for each user 1-K. The 
benefit of supplying likelihoods to the decoders 532, 534, 536, 538 is the improved 
performance when using soft-decoders. It should be noted that the same processing nodes 
used for implementing the bank of MMSE filters for symbol estimation for processors 520, 
522, 524, 526 can be used to implement the A* decoders, 532, 534, 536, 538 which are 
partitioned between the compute nodes. However, this embodiment also covers an 
implementation for hardware dedicated to MMSE processing and separate, hardware for 
decoding. , . 

[00121] The following discussion describes one embodiment for an implementation when 
a bank of processing nodes is used for both MMSE processing and decodmg. Symbol 
likelihoods are routed over tiie SAN 530 based on the assignment of decoders 532, 534, 
536, 538 to processing components for decoding. The SAN 530 routes the M ^ x 1 vectors 
of symbol likelihoods from the symbol estimation processing previously used to perform 
MMSE-based symbol estimation on processors 520, 522, 524, 526 to the same set of 
processing nodes that now perform the decoder processing on the bank of decoders 532, 
534, 536, 538. The SAN 530 passes AT frames of Mxl symbol estimates to the 
jRT decoders 532, 534, 536, 538, wherein the frames of symbols are represented by for 
the iS: users. The frames,^, represent a frame for each user that contains M symbols. 
[00122] The number of decoders 532, 534, 536, 538 corresponds to the number of users, 
1-^. The bank of decoders 532, 534, 536, 538 may implement hard-in, soft-out (HISO) 
or soft-in, soft-out (SISO) decoders. In addition to decoding, the bank of decoders 532, 
534, 536, 538 performs descrambling, de-interleaving, de-puncturing, and rate 
identification. The output of each decoder 532, 534, 536, 538 is,/^', which is 
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aMxl vector representing an update of M likelihoods in the frame following the 
implementation of the decoders. In addition, the decoders 532, 534, 536, 538 supply 
likelihoods of the information bits values dk for the same collection of symbols. These 
information bits are passed out as output 540, 542, 544, and 546. The number of bits per 
user is dependent on the data rate, and there bit likelihoods are available at any time in the 
iterative scheme for post-processing such as a cyclic reduction check, which is xised to 
determine frame errors. The number of information bits transmitted per second is less than 
the number of symbols per second. Therefore, the number of information bits will be less 
than My which defines the nimiber of symbols per frame. 

[00123] The updated likelihoods of the symbols K Mx\ vectors of likelihoods, ^ are 

fed-back through SAN 550 and processed by a bank of MMSE filters which are 
implemented on the M processor nodes, 560, 562, 564, 566. The ^Mxl vectors of 
likelihoods, \ prior to the SAN 550 are sent to the MMSE filters as M jSTxI vectors of 

likelihoods, ^ However, now the bank of MMSE filters is used for ampUtude estimation 




[00124] The same bank of 1 - iW symbol estimation processors 520, 522, 524, 526 used to 
implement the initial MMSE filters for symbol estimation can be used for the ftirther 
MMSE filter iniplementation for amplitude estimation, illustrated by processing modules 
560, 562, 564, 566. The MMSE processing niodules 560, 562, 564, 566 in amplitude 
updating focus on the same observation intervals an^alyzed during the symbol estimation. 
The MMSE processors 560, 562, 564, 566 at this stage implement the functionality of the 
filter module 415, 420, 430 of Figure 4. This bank of MMSE filters implemented by 
MMSE processors 560, 562, $64, 566 draw from local memory 552, 554, 556, 558 to 
retrieve prior amplitude estimates, updated symbol estimates, S-matrices, and the received 
data vector corresponding to the observation interval m . 

[00125] Referring back to Figure 4, each amphtude processing module 415, 420, 430 
computes an amplitude update vector, A^, which is Ky.\ vectors. Each amplitude 

update vector is passed to a central node that implements the functionality of the 
Amplitude Estimator 435. In Figure 5, all individual update vectors from the amplitude 
estimation processors 560, 562, 564, 566 are passed to a central processing node 575 via 
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the SAN 570. As detailed herein, the amplitude estimate is updated by weighting the sum 
of all amplitude updating vectors, wherein the weightmg is based on quadratic terms 
reflectmg the uncertainty in the update, . These quadratic terms are also passed to the 
central node 575 to provide the necessary weighting. 

[00126] The SAN 570 passes all Aa„, vectors ( M vectors of size A: x 1 ) and matrices, 
(Af hermitian KxK matrices with K{K'^l)/2 unique elements) from the MMSE 
processors 560, 562, 564, 566 to the central processing node, 575. The Amplitude 
Estimator 575 computes the functionality described for the Amplitude Estimator 435 of 
Figure 4 and the updated estimate gets distributed to all Af processors used for the next 
MMSE-based symbol estimation. Therefore, Ainplitude Estimator 575 passes the updated 
amplitude estimate to SAN 580 tiiat fan out these estimates for all JST users to the M 
processors 520, 522, 524, 526 for the symbol estimation processing. 
[00127] The updated amplitude estimates refreshes the amplitude estimates stored in the 
local memory 510, 512, 514, 516 coupled to the MMSE processors 520, 522, 524, 526. 
Following the receipt of the updated amplitude estimates, the next iteration of the MMSE- 
based symbol estimator is implemented/Therefore, the MMSE modules 510, 512, 514, 516^ 
perform the symbol estimation processing and repeat the symbol estimation on the next 
iteration using the new amplitude estimates. As described herein, the iterative processing 
continues until estimates of the bit error rate metric achieves a certain level or until a fibced 
nimiber of iterations have been executed. 

[00128] The hardware configuration presented in Figure 5 is an attractive implementation 
because bandwidth requirements are minimized by passing a total of only iT -M floating 
point numbers over the SAN 530, except during amplitude updating when the matrices 
Q„arQ passed over the SAN 570 to a central node 575. Bandwidth can be further 
minimized during the amplitude testing by approxhnating Q„ as a diagonal and therefore 
pass a total of2K • M floating point numbers during amplitude updating. Smce the MMSE 
operations, both symbol and amplitude, occur at the same processing nodes on processors, 
520, 522, 524, 526 and 552, 554, 556, 558, less data is processed and efiBcient techniques 
can be used to exploit previous MMSE filter weights. 
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[00129] The communications conjBguration described in Figure 5, is more easily 
recognized in Figure 6, which shows 1 - L nodes responsible for all processing. These L 
nodes pass data between themselves using a high speed interconnect 625 for the system 
area network (SAN). The modules 600 correspond to the time when the nodes 1 - L are 
used to implement the bank of Af MMSE filters for symbol estimation in M observation 
intervals. The symbol estimation processing at node 600 is comparable to processing 
performed on MMSE processors 520, 522, 524, 526 shown in Figure 5. 
[00130] The number of L nodes depends upon the size of the problem, typically the 
number of ^users, and the available hardware. For example, if ^ is small, then more than 
one MMSE filter operation can be performed per node in which case L<M. UK is large 
such that more flian one node is reqmred per MMSE filter operation, then I>M. The 
capability of the node is based on processing speed, local memory, design constraints, and 
concept of operation. 

[00131] The data firom symbol estunation processing nodes 600 is passed by the high 
speed mterconnect 625 to the decoder, processing nodes 605. The decoder processing 
nodes 605 are Z nodes used to implement a bank of decoders such as SISO decoders, 
wherein the decoders for K users are distributed among the L nodes. This decoder 
ijrocessing corresponds to the bank of decoders 532, 534, 536, 538 of Figure 5. While 
Figure 6 illustrates an implementation using common hardware, for the unplementation 
where the hardware for MMSE and decoders are different, the L nodes 605 could refer to 
dedicated decoder hardware. 

[00132] The communication between nodes firom the MMSE symbol estimation 
processing stage to the nodes for the decoding stage of the algorithm is illustrated by the 
following example. For illustrative purposes, let the number of nodes equal the number of 
observation intervals, Z = M . This simplification is not an architecture restriction, but 
simply a convenient way to illustrate the communication between nodes. Based on this 
simplification, the output of each node 600 in the symbol estimation section is a Ky.\ 
vector representing symbol estimates for tiie K users. Each node 605 in the decoder 
section is responsible for decoding a subset of the number of users. To fiirther sunplify the 
discussion, let each node 605 in the decoder section execute two decoders. La other words. 
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each node 605 performs the processing for two processors of the bank of decoders 532, 
534, 536, 538 from Figure 5 so that the number of nodes L in this example would be one 
half the number of users KiL-Vi K). 

[00133] Therefore, based on these simplifications, for purposes of explanation only, the 
symbols from the symbol estimation nodes 600 that correspond to users 1 and 2 in the first 
observation interval are two of the K symbol estimates determined in Node 1. The 
symbols firom Node 1 of the symbol estimation section 600 are transmitted to the Node 1 
of the decoder section nodes 605, wherein Node 1 of tihie decoders nodes 605 represents the 
decoding of users 1 and 2 for the first time interval. The symbols for the users 1 and 2 in 
the second observation interval. Node 2, fi:om the symbol estimations section 600 are 
passed over the high-speed mterconnect 625 to Node 1 in the decoder section 605. 
Similarly, the symbols for the same two users, users 1 and 2, in the third observation 
interval. Node 3, from the symbol estimation section 600 are passed via 625 to Node 1 in 
the decoder section 605. This procedure continues for the M observation intervals for the 
symbols corresponding to the first 2 users, user 1 and user 2. 

[00134] As should be readily apparent firom this example. Node 2 of the decoder section 
605 processes ujser 3 arid user 4 for all the Z observation nodes 600; Node 3 of the decoder 
section 605 processes user' 5 and user 6 for all the Z observation nodes. By way of fiirther 
explanation. Node 4 in the decoder section 605 is responsible for decoding users 7 and 8, 
and receives two symbols corresponding to users 7 and 8 from all the Z observation nodes 
600. For example. Node 1 of the symbol estimation section 600 passes the symbol 
estimates of users 7 and 8 to Node 4 of decoder section 605. Similarly, Node 2 of the 
symbol estimation section 600 passes the symbol estimates for users 7 and 8 for the 2nd 
observation interval to Node 4 of decoder section 605. This procedure continues for all L 
nodes in the decoder section 605. Therefore, symbol estimates out of the MMSE symbol 
estimators 600 are routed from the Af observation intervals to the isT decoders of the 
decoder section 605 based on the mapping of the iT decoders to the L available processing 
nodes. 

[00135] The next section is the amplitude estimation section 610 for nodes 1 - Z. The 
amplitude estimation section 610 represents L nodes implementing the MMSE stage of 
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the amplitude estimator, as shown by modvdes 560, 562, 564, 566 in Figure 5. Prior to 
implementing amplitude estimation, the amplitude estimation nodes 610 receives all 
symbol likelihoods via the high speed intercoimect 630 following the execution of the 
nodes from the decoders 605. As described in Figure 5, the likelihoods of the symbols out 
of the K decoders 532, 534, 536, 538 are routed back to M MMSE filters 560, 562, 564, 
566 for the amplitude estimation stage of the algorithm. For this example, two decoders 
are implemented per node in the decoder section 605 and one MMSE filter processor in the 
amplitude estimation section 610 is executed per node. The output of Node 1 in the 
decoder section is a Jl/x 2 array of likelihoods corresponding to the M symbol likelihoods 
for the first two users after implementing the decoder processing 605, descrambler, de- 
interleaver, and rate identification. Node 1 in the amplitude estimation section 610 retains 
the symbol likelihoods for the first two users, user 1 and user 2, in the first observation 
interval. Node 2 in the amplitude estimation section 610 is responsible for the MMSE . 
implementation over the second observation interval. Therefore, the second set of bits in 
. the M X 2 array* is passed from Node 1 in the decoder section 605 to Node 2 in the . , 
. amplitude estimation section 610. This is repeated for all M observation intervals. . 
Therefore, the outputs of the decoder section 605 are passed to all nodes 610 computing 
the amplitude updatir^ vector in the A/ observation intervals, 

[00136] The next set of nodes is the amplitude weighing section 615 responsible for 
updating the amplitude estimate and which implements the functionality of the amplitude 
estimator 575 of Figure 5. This processing is performed at a single node, and for 
illustrative purposes Node 3 is depicted and described herein. It shoxild be noted that the 
• other nodes of this section 615 of the algorithm remain idle while the amplitude estimation 
is completed. Therefore the high-speed interconnect 635 passes the ampUtude updating 
vector and quadratics from the L nodes in the amplitude estimation section 610 Node 3 of 
the amplitude weighing section 615. 

. [00137] The functionality of the Amplitude Estimator 435 of Figure 4 provides a more 
detailed explanation concerning the weighing implementation. The amplitude estimate 
calculated by Node 3 of the amplitude weighing section 615 is "fanned-ouf to all the L 
processing nodes for further processing of symbol estimation at the next iteration (I+l). 
Thus, the new amplitude estimate for all K users is distributed via the high speed 
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interconnect 640 to all the Z nodes of the next iteration symbol esthnation section 620 for 
the I+l iteration. The L nodes in the next iteration symbol estimation section 620 re- 
compute the symbol estimates, using the new amplitude estimates, for the K users in the 
M observation intervals. The stages and data routing are repeated until the maximum 
number of iterations is achieved or the desired bit error rate is reached. 
[00138] One aspect of the present invention is that it solves the previously described 
problem of tightly integrating parameter estimation, symbol hypothesis testing, decoding, 
and rate identification. The approach in one embodiment assumes a modulation scheme 
(e.g. CDMA) has been utilized. The solution presented herein in one embodiment is a 
windowed-based Minimum Mean Squared Error (MMSE) approach to Turbo-decoding for 
joint signal demodulation, which is easily extended to time-varying channels. The 
invention is based on an iterative decoding solution that exploits error correction codes. In 
addition, the approach of the present invention is suitable for symbol asynchronous as well 
as symbol synchronous. Also, the algorithm used is suitable for a variety of signaling 
schemes such as M-ary Phase Shift Keying (MPSK). 

[00139] "^It will be appreciated that impfementation of the present invention solyjes the 
problem of the imknown chaimel parameters in state of the art detectors. The present 
mvention works with many varieties of multiuser detectors and is easily implemented into 
dififerent multiuser detectuig systems. Specifically, the MMSE symbol estimators in 
modules 600 and 620 of Figure 6 can be substituted with any of the known or referenced 
MUD algorithms. 

[00140] The present invention dramatically reduces the number of computations needed 
in a blind approach to joint signal demodulation and without pilot tones or training 
sequences, so that reliable operation can be achieved in a real-time implementation. It will 
also be appreciated that since the present technique does not require knowledge of training 
sequences or existence of a pilot signal, it is considered a blind approach to symbol and 
data rate estimation. The efficient implementation of blind estimation is realized because 
of the high degree of parallelism. This provides an eflBcient implementation because of 
reduced number of operations associated with updating MMSE filter tap weights. 
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[00141] Further efiBciency is realized because the number of iterations required to achieve 
a certain bit-error-rate for case of unknown amplitudes is typically similar to the case when 
the amplitudes are completely known. The present efficiency occurs because the 
degradation in bit-error-rate for the case of unknown amplitudes is manageable relative to 
the case of known amplitudes. Thus, the present approach represents a new method to 
estimate channel amplitudes and extension to time-varying channels is obvious to those in 
the art. The present approach also represents a new method to reduce biases in ampUtude 
estimates based on incoherent integration. 

[00142] The foregoing description of the embodiments of the invention has been 
presented for Ihe purposes of illustration and description. It is not intended to be 
exhaustive or to limit the invention to the precise form disclosed. Many modifications and 
variations are possible in light of this disclosure. It is intended that the scope of the 
invention be limited not by this detailed description, but rather by the claims appended 
hereto. 
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What is claimed is: 



1 1. An apparatus for processing a digital data stream firom multiple users, 

2 comprising: 

3 an initial amplitude estimation unit processing said data stream and 

4 producing initial amplitude estimates on a first iteration; 

5 a joint amplitude estimator coupled to said data stream and said initial 

6 amplitude estimator, wherein said joint amplitude estimator produces 

7 updated ampUtude estimates; 

8 a symbol estimator coupled to said data stream, said initial amplitude 
. 9 estimator, and said joint amplitude estimator, wherein said symbol estimator 

10 produces a plurality of symbols estimates for each user; and 

11 a bank of decoders coupled to said synibol estimator, producing a plurality . 

12 of symbol likelihood estimates for each user, wherein said symbol 
^13 likelihood estimates are iteratively fed back to said symbol estimator and 

14 saidjoint amplitude estimator until a final condition is obtained. 
15 

1 2. The apparatus according to claim 1, wherein said final condition is selected 

2 . . from at least one of the group consisting of: bit error rate metric level and 

3 fixedniraiber of iterations. 

4 . 

1 3. The apparatus according to claim 1, wherein said symbol hypothesis testing 

2 module is a member selected from at least one member of the group 

3 consisting of: Minimum Mean Squared Error (MMSE), maximum 

4 likelihood, M-algorithm, T-algorithm, and Q-algorithm, decorrelating 

5 decision-feedback detector (DDFD), improved decorrelating decision- 

6 feedback detector (IDDFD), successive interference cancellation (SIC), 

7 parallel interference cancellation (PIC) and multi-stage detector; block- 

8 iterative interference cancellation, and a deferred decorrelating decision- 

9 feedback detector. 

41 



CT/US2003/038190 



CLAIMS 



wo 2004/095713 ^9CT/US2003/038190 



The apparatus according to claim 1, further comprising an energy detector 
coupled to said joint amplitude estimator. 



10 

1 4. 

2 
3 

1 5. The apparatus according to claim 1, wherein said bank of decoders are 

2 decoders selected from at least one decoder of the group consisting of: Soft- 

3 output Viterbi , Maximum A Posteriori , and BCJR. 
4 

1 6. A method for providing initial amplitude estimation for a plurality of user 

2 channels^ comprising: 

3 separating said user channels into active charmels and inactive channels; 

4 processing said inactive chaimels according to the sub-steps comprising: 

5 applying a bank of filters to said inactive channels for each inactive 

6 ^user; ' 

7 squaring an output from said bai^k of filters; 

8 summing an output from said squaring operation; and 

9 calculating an average bias estimate from an output of said 

10 summing; and 

11 processing said active chamiels according to the sub-steps comprising: 

12 applying a bank of filters to said user data of said active channels; 

13 squaring an output from said bank of filters ; 

14 sunnning an output from said squaring; and 

15 removing said average bias estimate fix>m an output from said 

16 simmaing. 
17 

1 7. The method according to claim 6, said processing said inactive channels 

2 further comprising dividing said inactive channels into groups and 

3 calculating at least one group average bias estimate. 
4 
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1 8. The method accordmg to claim 7, said processing said active chamiels 

2 further comprising dividing said active chamels into groups and 

3 respectively removing said at least one group average bias estimate. 
4 

1 9. The method according to claim 6, wherein said summers further comprises 

2 scaling. 
3 

1 10. The apparatus according to claim 6, wherein said separating said active 

2 channel j&om said inactive channels is performed by an energy detector. 
3 

1 1 1. An apparatus for processing initial amplitude estimates from a data stream 

2 of multiple users, comprising: 

3 an energy detector measuring an energy level of said data stream, wherein 

4 said energy detector separates a plurality of active channels from a plurality of 

5 inactive chaimels; 

6 a bank of filters coupled to said inactive channels; 

7 a bank of squaring operators coupled to said bank of filters; 

8 . a bank of inactive summers coupled to said squaring operators; 

9 an average bias estimator coupled to said inactive summers calculating an 

10 average bias estimate; 

11 . a bank of filters coupled to said active channels; 

12 a bank of squaring operators coupled to said bank of filters; 

13 a bank of active summers coupled to said squaring operators; and 

14 a bias removal section coupled to said active summers and said average bias 

15 estimator. 
16 

1 12. The apparatus according to claim 11, wherein said inactive channels are 

2 divided into groups calculating said average bias estimate for each said 

3 group. 
4 
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1 13. The apparatus according to claim 12, wherein said active channels are 

2 divided into said groups and wherein said bias removal section for each said 

3 group removes said average bias estimate for each said corresponding 

4 group. 

5 

1 14. A joint amplitude estimator for a data stream from multiple users, 

2 comprising: 

3 a data stream from said multiple users divided into a plurality of 

4 observation intervals; 

5 a. plurality of processing modules coupled to said observation intervals, 

6 wherein said processing modules calculates interference cancellation values 

7 for each of said users and computes a filter for each of said observation 

8 intervals, said filter being applied to said data within said observation 

9 interval to compute individual amplitude estimates; and 

10 an amplitude estimation umt wMch processes said individiial amplitude 

1 1 estimates and calculates new amplitude estimates, wherein said new 

12 amplitude estimates are iteratively passed back to said processing modules 

13 until a final condition is obtained. 
14 

1 15. The joint amplitude estimator according to claim 14, wherein said final 

2 condition is selected from at least one of the group consisting of: bit error 

3 rate metric level and fixed number of iterations. 
4 

1 16. The joint amplitude estimator according to claim 14, wherein said 

2 amplitude estimation unit sums and weighs said individual estimates. 
3 

1 17. The joint amplitude estimator according to claim 14, wherein said 

2 observation intervals are based on distinguishing attributes selected from at 

3 least one of the group consisting of: time, code, and frequency. 
4 
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1 18. The joint amplitude estimator according to claim 14, wherein said filter is 

2 calculated for an arbitrary observation interval m as: 

4 

1 19. A method for processing amplitude estimates for a multiuser data stream 

2 divided into a plurality of observation intervals, comprising: 

3 computing a filter for each said observation interval of said data stream; 

4 applying interference cancellation to said data stream for each said 

5 observation interval; 

6 applying said filter to said data stream for each said observation interval 

7 from said interference cancellation to produce individual amplitude 

8 estimates for each said observation interval; 

9 1^ computing ne^ amplitude estimates Using said individual amplitude 
10 , - estimates; and 

U . , passing said new mnplitude estimates back to said filter for iterative 

12 processing until a final condition is obtained. ' « 

13 

1 20. The method according , to . claim 19, wherein said computing fiirther 

2 comprises summing said individual amplitude estimates and weighing. 

3 , 

1 21. The method according to claim 19, wherein said final condition is selected 

2 from at least from the group consisting of: bit error rate metric level and 

3 fixed number of iterations. 

4 . 

1 22. A turbo-decoding system for joint signal demodulation, comprising: 

2 a data stream from a plurality of xisers divided into a plurality of observation 

3 intervals; 
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4 a plurality of symbol processing nodes processing said observation intervals 

5 to compute symbol estimates for said data stream within said observation 

6 interval; 

7 a plurality of decoder nodes processing said symbol estimates and 

8 producing a plurality of symbol likelihoods; 

9 a plurality of amplitude update nodes processing said symbol likelihoods 

10 and calculating a plurality of amplitude update vectors; and 

11 an amplitude estimator node processing said amplitude update vectors and 

12 producing an amplitude estimate update, wherein said amplitude estimate 

13 update is passed back to said processing nodes for iterative processing 

14 between said symbol processing nodes, said decoder nodes, said amplitude 

15 update nodes, and said amplitude estimator node until a final condition is 

16 obtained. 

17 

1 23. The systegfi according to claim 22, further comprising a plurality of memory 

.2 y i^iits, wherein at least one memory unit is coupled to each of said 

3 processing nodes and to each of said amplitude estimation nodes. 
4 

1 24. The system according to claim 22, further comprising a first system area 

2 . network (SAN) coupling said processing nodes and said decoder nodes, a 

3 second system area network (SAN) coupling said decoder nodes and said 

4 amplitude estimation nodes, a third system area network (SAN) coupling 

5 said amplitude estimation nodes and said amplitude estimator, and a fourth 

6 system area network (SAN) coupling said amplitude estimator and said 

7 processing nodes. 
8 

1 25. The system according to claim 22, wherein said final condition is selected 

2 ftom at least one of the group consisting of: bit error rate metric level and 

3 fixed number of iterations. 
4 
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26. The system according to claim 22, wherein said processing nodes, said 
decoder nodes and said amplitude estimation nodes are one set of nodes. 

27. The system according to claim 26, wherein said one set of nodes are each 
comprised of at least one processor. 

28. The system according to claim 22, wherein said likelihood symbols from 
said decoder nodes are accessible at any time for post-processing. 

29. The system according to claim 22, wherein said processing nodes further 
comprises a thresholder to convert said symbol estimates to symbol bits. 

30. The system according to claim 22, wherein said decoder nodes performs de- 
interleaving, rate identification, de-pmicturing and de-scrambling. 
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